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(54)Titlc: SYSTEMS FOR THE MASS PRODUCTION OF PROTEINS OR PEPTIDES BY MICROORGANISMS OF THE 
GENUS HUMICOLA 



(57) Abstract 



Expression systems enabling the mass production of 
proteins by microorganisms belonging to the genus 
Humicola , in particular, H, insolens , among all, 
host/vector systems and methods for producing proteins 
with the use of the same. Expression vectors containing the 
regulatory sequences, i.e., the promoters, signal sequences 
and terminators of cellulase NCEl or NCE2 genes 
originating in H. insolens are constructed and utilized. 
The use of these expression vectors makes it possible to 
produce, for example, cellulase NCE4 at a high efficiency 
of about 4.5 kg or more of the product per liter of the 
culture broth of//, insolens . 




Patent provided by Sughrue Mion, PLLC - http://www.sughrue.com 



(5 7) mm 

d ^_^_ n 5*-^-*d^*S^^-*wis 



A L 

AM 

AT 

AU 

A Z 

B A 

BB 

BE 

B F 

BG 

B J 

BR 

BY 

C A 

C F 

CG 

CH 

C I 

CM 

CN 

CU 

C Z 

DE 

DK 



*-x Mr 
y^^-t • 7 r y 

-fy** 
*<y fr— i/ 

x^f x 

K-f ? 
y>^r — ? 



ES x~:^> 
F I 7>r>-7>K 

FR 7^>x 
G A 

GB p 

GE ?A-S?T 
G H 

GM Xf>fT 

GN ^nT 
GR 

HU a>^!I- 

ID OK*^T 

I E T<A>y>\: 

1 L ^ X^a:/p 

IS 7^7>K 

IT >f * y T 

J P a* 

KE *r-T 

KG *A-*x*> 

KZ 



L R 

L s u y k 

LV y hV^T 

MC *-f-=i 

MG -*?-tf*ljA< 

MK -7>r K--riB-x-rf x 

MN *>=f/i, 

MR ^e-y^-r 



MX 

NE — ^ _r 
NL 1iy>? 
NO //l-'Jx- 
N Z = 3. — • 7 K 
PL *-?>K 
P T tfA- h # a- 

f^tefft£&#tie8' Ij>y ,Sughrue MiorK 8lI_S ^5j|^www.sughrue.< 



SG /i^ 

S J xn -7 

SK ^tW 7 47ftfrB 

SL i/X7^ 

SN 

SZ x?^v>h* 

TD ft-K 

TG h-r/ 

TJ *>>*x*> 

TM hA'^y^.x^> 

T R h A> =» 

TT h K . h/^=/ 

UA <>*y4-t 

ug 9#>y 

us *B 

UZ ^x*^***^ 

VN !/YxhtA 

YU -i-.-fX7t*7 

zw -;>/^x 



ii r 

WO 98/03667 



1 



PCT/JP97/02S60 



m m m 

(Humicola insolens ) IC&^T. * tUt^zT^ K***§S3S* fcii 

/ n° ? m&£M-$- z mm, ic ^ # * ^ r t> ifi]± *<sin « i. * c 

Aspergillus nidulans (G. L. Gray, et al. , Gene, 48, 41, 1986). 
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Aspergillus oryzae (T. Christensen. et al. , Bio/Technology, 6. 1419.1988 ), 
Trichoderma reesei(Taina Karhunen. et al. . Mol. Gen. Genet. 241. 515-522. 1993X 
Trichoderma viride (C. Cheng, et al. . Agric. Biol. Chen. . 55.1817.1991 ) mz 

jB^r*aLr*5t). *<D±mai*. mmmiLmtiQi. 0-3. 3 g ait? 

&m.mt#m7 I = 7 -f>7l/>X (Humicola insolens ) fefcftfc* > 

<Hr;b7 - if *«4t* C i t>JDbnT^« (WO 9 1/1 7 2 4 3-t&f8 (# 
^¥5 -5 0 9 2 2 3^«) ) 0 

WP8 -5 6 6 6 3#&«l=IW*tiT^*J;^ic % BWIEifcJRjWrastu it 
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3 

y V L/ > x jcfc if z g * n° * g <Z);*d^g;£&£5£4 L 

U«£Kcfcfttf. Btt*>'<*3^D£Mtt(i8t*<0 1 0-16 
S&sps© «fc ^ ^ > / * ? foil^^li $ ft 6 o 

EI Hi. ^7Xi KpM3- l©#JKil*lfiBI'Z?*-5o 
EI 2 fcU ^7X; K p M 1 4 - 1 <Dftm&f&&m-C& So 
H3lix ^7X; HX^-pMKDO l©»^*t£ElT 1 &£o 
El 4 W\ 7^ X 5 KX* ^-pEGDOl ©f&JPBBmUfiElT-* So 
BI5M^ ^7^; K^^ ^-p I EDO 2®*«JI8»*ifiEI"C ! *5o 

EI 1 IcfBttOJt&Ere^S ft ■£ 75 x = K p M 3 — 1 T?^JH£SI$ J 
M10 9*fctt> FERM BP-5971 Oi«6Flfc:FERM P- 1 4 4 5 9. 
M^fEB : 1 9 9 4^8^30) ©gffc#-l;<Z> i ISSlflifi^^I 
fliSi^ir (B*S^Mm^<(fmmi-l-3) lc3FfE£ftT^* 0 

E12fcf2$g<D^ia^£ft£75x ~ KpMl 4- l-e^KI£gl$ftfc^d»H 
J Ml 0 9^(i> FERM B P - 5 9 7 2 0R«FSE : F E RM P-1458 
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5 . mm&u .• 1994^10^18 b) t tMrnmrn^jim^m 

^(u^k^-pmkdo i-cfcfs&mztittmmjMi 0 9t* 

<*> FERM BP-5 9 7 4 dWFK:FERM P-1 5 7 3 0, 

1 9 9 6^7^ 1 2 b) ©ftrewofeiaimsit^xat^^^x^naRa 

*^K£SfgM-<^-pEGDO l-eiefei^tl^IiJMl 0 9# 
FERM BP-5 9 7 3 (M^Fie: FERM P-1 5 72 9, R»KB : 
1996^7^12B) ©§f£#^©fc<i:il^m^XS^^^X^xm^ 

*^lcJ;^^K^^-p i ED02TiI^tlfc«jMl 0 9** 
tt. FERM BP-5 9 7 5 OMFK: FERM P-l 5 7 3 U I«ffig : 

1996^7^123) ^>^m^<Dhtmmmm^xmmmm^j:^xm^ 

*»«k^K^-pNCE4Sa iTKKfiK^^IUMl 0 
'9*tt> FERM BP-5976 (M^fE : FERM P- 1 5 7 3 2. m%r 
KH:1 9 9 6^12H) ©SKM-© *> <fc»ffl!WI*XJKailSK^^X¥ 

2 0 0- 1 |;U FERM BP-5977 CKffi : FERM P - 1 5 7 3 6 N 

m*kb : 1 9 9 6^7^ 1 5 b) a&»^b£affigOTxiHSH*^* 
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L < Ji 2r t fc JiM5^»^o#Un*<tt £ ft*: t> © 5 0 

*mmz&zmmE&\ti*^ l < «#§a^8 - 5 6 6 6 3 #^«icie«o 

7;3-7-^>Vb >X*f©t;b7-^N C E 1 M&frDMmW&k 3 b 1= 
4#M¥8 - 1 2 6 4 9 2^IB»ciB*8©7 ;J-7 - -f>VU>x4^0^7 
— tfNCE 2it^?©*J»E^jT*-56 £t> J*#e*J«cf*» FERM BP-59 
7 lfc^O'FERM BP-5 9 7 2 © h £^ffE£ ftT^ Z>mmtp<D75 x ^ K 
pM3 - 1 fc^O'^^X $ KpMl 4 - NC E 1 *>«fctf2 OftJfflE' 

*HMaic*5^T»* U>"7n*- ^-Be^J©^"Ji LT(i> El 1 OifeElT^ $ ft 
5^7X; KpM3-l+<0, N C E 1 ilfe^ON^^ $>±3fc©$J 1 5 0 0 b 
p *-e©fi«*{c#^-r-5EJik i^lSWON C E 1 &fc?<D N5fcS£*>&±&IE 
<DB g 1 Ili^-f h^TOE^Wbtl5 0 

mT-mtsnZ-J^Xi KpM14-lc£©> NCE2it£^N;fci{&;&>t>±afeCD 
£jl 5 0 0 b p tT-Ofi«*{c#^-r5iS^K #];itfEI#©N C E 2it&?©N 

wya*-^-stt*»«fr**©aaaa^«j*>#*ft«o ^ishsk&^t, ^ 
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2. Og. $?£L<(44. 0 g> <k<0ft£ L< (44. 5gONCE4©^I^ 

mt z -f o * - * -sitmt « t> © 4 1 ho WRt & mmmizimoto^ 

FERM BP-5 9 7 1*J:CFFERM BP-5 9 7 2©tiWe*fttO 

sin** *J:owi*fctt2©«ffl^jL&nfc^iit«T»*ntf. ^^9^ 

1 * tut 2 O^T-AEWflfStiS, =t t) J^ciiBAjff i ,c3H«© T 
S JWmm- 22K-1 =1- Kt-*ifl»^lJ x fccka'SE^-f- 

tt*»JW- ^7 ; y H£fi|« = - * fc© *>£2^2 ft* 0 c © «t 3 ttQkXETiJ 
(wOC>rt>. ^ISt-^^fi^JidlBig©^^ FERM BP- 5 9 7 1 *><fcCfF 
ERM BP-5 9 720t>i^nT^5ffi*, *5 «fc 1 £ fc (4 2 (Dim 

CE l*fc(4 2©N5fc#{M©^<oj^©T$ y^#Jn$nTt>«k^Ci(4^ 

NCEllfcli20N*MJ©^<o^7 K4 
©ftte* ?f»i:liN C E llfcttNC E 2 LT 

^^^^^ KpM3-l*0, NCE l»e?©C3|a*3^&T»©»l 4 
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0 0 b p*-e©««*lc#as-*-<5E*k WitfNCE 1 itfe^O C t> T«E 
OBg 1 Il-th-f h*-t?©E3W$*tffc>n<5o SfJcD^Ji Ltli, IH2©it6gI 

KpM14-l*®> NCE 2&B=?<DC3Z%SfrZ>TM<Dm 

5 o o b p n-ctotimtpiz&tE't&wffiL mz-iiMz-rtu cei csta* 

Ctlt>$iJ^ie^k i^ltNC E 1 tizltNC E 2 © 7 n * - * HE?<J M\ N 
fc<9 2. Ogs KftL<li4. 0 g> L < (i$J4. 5gKii*-*o 
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- * -DV£I^> tc^oTUi ±15 L J: « ^ ^ 
tt, »5»Ji:fc^ii^/:ii^n - pMKD 0 K pEGDO 

■*WPff*tt^ pUC Vector. pTV Vector. pBluescript . pBR3 

^>*2-ffl^5J#£-. Streptomyces rimofaciens&5fc©-x-'x h 7>f 
^ Escherichia coliE&*(ZW v-f •> >B »tt*fir^ Streptococcus hi 
ndustanus U*-^ 5, >Htt»fc? % Streptomyces hygroscopicus** 

^W*LOliiatlll &»O^TttteJ:^ Aspergillus nidulansi 

Gen. Genet. 199:37-45.1985) Ctl*fflV^T. ^h^i/>fitt 

(#§§BS59-175889 ) *3SHRT»= Lfc*Hr v h^JStiO^u^ 

Patent provided by Sughrue Mion, PLLC - http://www.sughrue.com 



WO 98/03667 



9 



PCT/JP97/02560 



U7-T— tr\ 7 ^ - -tf #g Ji±^ ft ^ > / ■? ? |f £ =J - Kf £ it^pa<3Stf ^ 

t-^^^Ji LT7 ^ rr-7 • -<> V l> >X£fiJJE-T£ 0 
NCE 4igfe? 

IT fc * IT* o T> ftjRiLW7-^NCE4iBIi 
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*-pMKDO 1. pEGDO L ttzitp I ED0 2Wb^ o 

^mm^nr^^^m^^zt^^^ #^8 - 5 6 6 6 3^ 

*<2i2«?£l h;U^J92. Og, ^Ui i4 . Og, i:^H<li 
09*.^ §^>/^f^t;l/7-tr > NCE3^tiiNCE4t-*5Ji^ Z 
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If©^ m*.& 1 5 °C~ 4 5 °C. 02 L < it 3 5 ^C~- 4 0 T\ igftB^te 2 
4-2 4 0^rffi!Sfi©*#"T?^-5 C <fcj6<T?*<5 0 
*^i:<koTit>ft5^ K©i$S^t,©|5jJK{c*> 

*^^OT©HiSfeM^j: oT»iHtcttB^-s^ ^mmnztuhmmm^m 

7i3-7-f>7l/>XMN200-l^ (N) JgH£ (5. 0%7tT-b;k 
2. 0 »x+x, 0. l%^U^^h>. 0. 0 3 %i£4t*J J\s ! s * 
0. 0 3%flfE8?v^*^A. P H6. 8)+, 37°CT±g^L^ 0 7BFI^1 
©m> »t>tl^*ift*7 0 0 0 r pmf2 0 #WaWl^ Z> C £ kz J; & 

C©fflfllSB!-fe;l/7 — tfWStt*aDK^ 07^77^ - (Phenyl -SepharoseHi 
gh Performance 16/1 00. 7 7 A7->7^^f^il!) d&U 5 0mM'J>» 

m (pH7. o) 1 -QM(Dmm£im$:frvtzmmT>*-<yi*mi&-z: 

(Phenyl -Sepharose High Performancel6/100 ) (C{&L. 5 OmM'J > 
&3&«&(pH7. 0) <*\ 0. 4-0M©«K4gE*^tj-/t«E»T>*^.^A 
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*<D®&&jm*n-?hy57 4- (Sourcel5 ISO. 7 T i/T'U 

*7-?tm> izmu 5 0mMu>nnm (pH7. o) <k i- 0 mo«ji 

OT**#fcWtT>*-*A^T?*ffiu ^1/Lfc 0 CO?*,. 0M©SS 

M^D7h/77^ - (Source 15 PHE > 7 7 ;l/T i/T'U*r9 &J© ic 
«U 50mM'j>iW( P H7. 0) <K 1 - 0M©jMME*^fc* 

E4iLTmSIL/Co C©NCE4liSDS-PAGEi:^T^fl4 3kD 

(i) N^jgr = jwss&owm 

(*^:RESOURCE «ftg) RPC 3ml, 0. 1%©TF 



o 



dtl**|Sf£«Lfe«t, 4>*©7tacj£#U 8%Gel SDS-PAGE mini (^^rr 
492 (/N°-*>i;U-7-ftisD KttU N£JMr 5 y KE*J« 1 5££Szfc£L 

»e>n^ieyij{i£iToat)-c?*ofco 
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1 3 

N ^ Jffl&n : Ala-Asp-Gly-Lys-Ser-Thr-Arg-Tyr-Trp-Asp-(Cys)-(Cys) 

-Lys-Pro-Ser (15SH) 

(2) W K7ye>^ 

Bute (i) <of PLCizz^xmrnztiiz? >'<?n*m&&mm. ioom 

Mlgt7>€^-7A1^ (pH 8. 0) \zimLtLo ?>'<?WlZttLfo 
l/2 0^H©h'J^->> (fnjtfikgD ZmmL 3 7°CT\ 4 8^HS 

>x;l,vfcb$!D T*7^^D7K77-< -*firO (77 5A.-C8 220x 

2. 1mm. 0. 1%TFA N 0 %7-tr h — h 'J ;U 0 . 0 8 5MTFA, 3 

5%7-feh-h'J;^7^x >h) „ 31®^^f K^BXL/Co n^ntz^ 

TP-l : Tyr-Gly-Gly-Ile-Ser-Ser (6 

TP-2 : Phe-Pro-Asp-Ala-Leu-Lys (6 ^S) 

TP-3 : Phe-Asp-Trp-Phe-Lys-Asn-Ala-Asp-Asn-Pro-Ser-Phe-Ser-Phe-Arg 

>jc >fc ^fc 

mmmt. wo9 1/1 724 3^&fg (#^5-50 922 3 j %&m~) j;:ts 

«c£tlTO£:7 ;3-7"f>Vl/>XDSMl 80 0^£l# t>tl/c 4 3 KD a 

JiIEiE^'J^Protein Identification Resource (PIR) R44. 0. March. 
1995. 2fc(iSWISS-PR0T R31. O.March 1995 lcSfg$nT^£E3flJ<hiWKLrt:<i: 
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M36ffJA3 : yyADNA^-f 7 s ? 'J — cpfptgji 

^VADNA©^H(iHoriuchit,©^ (Hiroyuki Horiuchi et al.. J. Bac 
teriol., 170:272-278,1988 ) \z.'fc->tz a 

£-?\ 7;3-7-r>VU>XMN2 0 0-l« (N) ig*tfic£ N 3 7 ° c 
-e^f«L^o 2 HRi!ig#©& x ^Cvftgf (3 5 0 0 r pnu 1 0#) ia^Tl 

£ £ictf>l j^U^'ij r7_;u (PEG) ftl&ftK J; fr) y 

MDNA^/; 0 

7; 3-7 >yu>xy7ADNA £Sau3A I lz\X^M<tU T 
^D-xy';Hf«ij:j; 0 ^ 9^2 3 k b p e>mmT«m#tf3lzftM2tltzZ 

7-V^??-^ EMBL3 ^ D -^>r+^h Uh^y->y:© ©BamH IT 

-AicT4';#~hr Gft^Httfi) *ffl^-castt**fc 0 

^ TE (10mMhUxtti(pH8. 0) , ImM EDTA)Iftti:# 

ail§S£^©£*£Hohn. B. ©7j& ( Hohn, B. Methods Enzymol. . 68:299- 
309,1979 ) ^nmznT^zmimMltzs<y^-Vj$ft&£tf*Jf;<y 911 

-vU »^tlfc7T-^**»BLE 3 9 2l*(cfijft$-a-A: 0 CO^lcJ: t) 
DNA7*n-7i LT> 7 5 ^-^ . ^>7U>X©^DNA^j|ii:PCR 
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1 5 

JS"*-SDNA*^J5fcLfco fEKLfc^fiW-U =3*7 ? U^f- K©i£?iJ{iOTfC7K$ 

NCE4N1 : 5'-GCXGA(CT)GGXAA(AG)TC(AGCT) AC-3' (17mer) 
NCE4N2:5' GCXGA(CT)GGXAA(AG)AG(CT)AC-3' (17mer) 
NCE4C: 5-'CXGC(AG)TT(CT)TT(AG)AACCA(AG)TC-3' (19mer) 

(X: -fy->» 

N A 1 jtz g in** U "^-7 -f7-iL TNCE4NU NCE4C & 1 // Mfla^/c © N NCE4 
N2, NCE4C J t/MJPX.fcfe002ffl0^j.-y«:f^KL. £*H5»£\ dNTP 
£p£T\ 9 5°C. 5^lffl««tt*?Toyt:o *©&> Taq 0)=*>€ 
±> Maq . ^B3rtfc£D £fln*_, 9 4t:i#[Uk 4 5t2m 7 2 , C3#W 

2 5I§I«IOfi"rcifc«kt)itfiLfeo -e©*Sm> 
NCE4NU NCE4C £ffl^7ti§£©<?K J^j7 5 0 b pODNA ^l^^g cF tiiz 0 Ctl^ 

HWJA5 : t;l/7-«NC E 4iH£?©? p -.=.>?* 
(1) ^-^'W/'J^-f-tf-i'a >J£i;SX^ U-->** 
P CR&ic£^ififI$-t±-7t#j7 5 0b p©DNA#r£-l OOng^ fc&j^l: 
&ECL /YU? h DNA/RNA 7^'J>«->XfA (T v •> + A*t»D {Cck*K 

HlfJ 2 KiZM<DJj'mzm C TftWl L fzy r - 'J 7=7 - >7 \U /s 4 KN+7^ 
^fD>F 7 >X77-^77> (7vyt AfehlS!) \zo-oLt<0^ 0. 4 NtK 
Wtiki~ b U 7A-elt L fcSU 5 ^ftK SSC (15mM?x >K=^- h'J^A> 
1 5 OmMttffc'*- h U 7A) T?ifej£U &m$l±D N AtmfeL tz 0 * -y h©75" 
llffllCT'W^r'J^^-v/a > (4 2t) ©gU 5fc©«8Mb 
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1 6 

hoartticttofco o. 4%sds, 6MjRxm 

0. 5fg^S S Clcfe*) 4 2°CT2 0 2 Slit)«U »C2«| 
S Clcj;»)M-c s 5^M«>ife»*2lsIff ofeo 

(2) 7 7-vDN AOIIS5! 

E. coli LE 3 9 2l:77-y^^^ 8W7 7 -«m 
Grossberger <DJj fe (Grossberger, D. . Nucleic Acids. Res. 15 6737. 1987 ) iz 

(3) gfi<Jjta^iZ)l7-y^o-^>r 

4*077-^DNA*Sal I mU 7^n-7l^i,(^Lfc 0 
DNA*Southern<D;&ft (Southern. E. M. , J. Mol. Biol. 98:503-517. 1975) (C 
«k»). 7-^D>y>y7>lC9oLi!)> Bute (1) 07-7- 7 y ^ -tf 
-fa 5 0 b p©^D-7£ffl(,wwyij^X£i*\ 

5. 2kbp©aWil« t *dtfDNA»rM-*«ffiLfcc ^©^m. 41©7r 
-^DNA^iH]— tMXOSal I WrM"^ LTV^ C 

C©5. 2 k b p O D N A »r^^ 1 7 7 ^7 7/< > 1/ 7 y * 7 h ( 77 ^ 
■7->7Aytr^tffi *JS^T#liU E. coli JM1 0 9^^7*77; K 
PUC1 1 9 ©Sal I Mc*^n-->r*ffofc 0 W&ftfc^XS K 
£pNCE4 S a 1 iLfco 
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mmmA 6 : mmmme>&M 

(1) ffJ ADNA<DtggK?iJft?tFr 

m&WM8ti&kW\U A.L.F. DNA II (7 7 ;bT^7^tf 

Si) &m^ti 0 •/-^■>->>miLm U-xV (7 7 ;^y 

L TA&PH&tZ T ? >)Jl>Ti Kfi#*«fflLfco y^f^BRffl^aaH (N, N, N 1 , 
N' - r- h^y =f-)\>3-=f-vy-J>T l afflKl7>*-*zO t LTlix A. 

L.F ru-Koai (7 7 il/vy7^tf?^) £fflWc 0 *8X15^J»aLR 
3" — h 'J — K — ^ > •> > ^'4=- >y h (7 t i/T'<4 *t ?*±§S0 

7^>^U- hDNA"C*5pNCE4 Sal^ 10// g© 2 MTKK-fk^ h U 
•7A?7;^'JIttLfc»> h 'J - K*>-^>«>>^+y h^©a^/<- 

- -eBM L/:iC^ 546bp ©JfiSE^ja^JHJl L fc Q C <D&m& £ . MNEG01 
ti^?l1cmWy-'r> : yyy^'^^^-^m.L^ pNCE4Sa 1 Ic^fLtK 

lt^^ fee*. NCE4ois«^tit 0 rmttiFiiasm 

is - *r y ; y y f-f'y << -7 - Ji£TRc j^S ft£ S 0 TF£> o fc Q 
MNEG-01: 5 ' -GTGATGAGGGCTGGCGACAGGCC-3 " (23mer) 
MNEG-02: 5'-CTGCCACCTCTATTGCCGGCAGC-3' (23mer) 
MNEG-03: 5'-CCCGACGCCCTCAAGCCCGGCTG-3' (23mer) 
MNEG-04: 5 '-GGCTGGAGCGGCTGCACCACCTG-3 ' (23mer) 
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(2) mmmw&m 

±IE (1) O^Ul^ MNEG-05 -MNEG-08 t£ & FITCUffS is — *r 1/ i/ 1/ f 

Ttc^ $ n a ii t) -3 7t 0 

MNEG-05: 5 "-GACCTGACGGAAGCTGAAGCTCG-3 ' (23mer) 
MNEG-06 : 5 ' -AGCAGTGCAGCCGCTGGGAGTCG-3 ' (23mer) 
MNEG-07: 5 -TGGCAGATGAGGACGTGGTGTTG-3 ' (23mer) 
MNEG-08: 5'-CGCAGCCGCACTTGGCGTCGAAG-3" (23mer) 

:tl^7^7-ipNCE4Sa 1 t(D*~ h U - K->-y >^>^+» y j, 

SallUffJtOrt, 1 2 5 7 bp0iSEW^T§fc o *<Z>IE*JttE3Wt3 fc 

-f > hn^o^j:^ ^ $ 3-7 . ^>vu>XMN2 0 0-lKmRN 
A*«SBU ia*fe^a^(cJ:«9 cDNA^U C tii ^ 7 Avm&Wd&l&it 

. (1) ^RNAOli®! 
7 ' 3 -7"f>Vl/ >XMN 2 0 0 - 1 — (fBWSifi, #2 L < « 

ahb (n ) mmz*s\.*T2 smmmu a**»b#flt o 5 0 0 r Pm . 10 
rr-i? i/ T >m&z&t;m&mmi om 1 (4Mmy>ft>>7> 

2 5mM^x>i=fhtj<>A, 0. 5 %N - 7 * 'J *> h 
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lml©2Mft»t MjCA (pH4. 5) T*?DU 1 0 m I <2> T E Sfrft] 7 ^ 
7 -Jlr&tiQZ-Z z>izffiftltz 0 CtUC 2 m 1 <D9 a n*;L,A— 4 1)17)1=1 
-)\y (2 4:1) £ < I jftC^H ( 3 5 0 0 r p nu 1 0#) 

lOmlO-f V^o/nV -;UT-M®e£2fcjStffcL*r 0 l?!et«*«iC^« (3 5 0 0 
rpnK 10^) LTm&t LTIhIJKU 7 0%x^y 7j<T-S^li^«U- 

- ©CtSS^ 3 . 5 m 1 ©TE 8 8 0 u 1 © 1 0 MSfrfb U 

^Afgjfc&Jn;^ 5°CT-2B#Pe^jg©^ »C.^H (1 2 0 0 0 r pm. 10^) 
<-«fcf?aJ8*®iRL^o ls]ZfcSS;te7 0%JL?; -)lT?m\ Ctl^RNAI^ 
iLfc 0 JK*li2. 7rag, iR^tiO. 14Wo/: c 

(2) ^'JAf^^ + RNA ( = m R N A) <Dm$H 

m R N A <DMMl£^ raRNA fiT^-f y-v'g >+7 h (7 7;l/7->7'< 

*f±S5 (1) -CSWLfc^RNACD?^ lmg^lml©i'Ja-> 3 > 

^< y 7 r - u c tifc 6 5 r> io ^rao^ttAas^jD^. fc 0 

0. 2ml®1r>^A' 7 7 7-^/:o :©M©RNAM^ 
'J n* (dT) t;bn-x*7A(:ft- MviF^'-y7 7--e3^ o? 

•y 7 T--C^ftilL^o C©77v A^fp^-2(Ej^<3igL. mRNAi^i L7C Q iR 
if* 19. 2 # 2 o /Co 

(3) c D N ACD&tfx, 

cDNA^Ii, ^-fA-fe-^-cDNA^^y h (7 7^7i/7^tf^ 
5 g©mRNA^2 0 u 1 CD-* >7;b'< >y 7 7-Ki%mLtz 0 6 5°C. 
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*i»*J:fft'Jtf (dT) ^7^7-i*(c^L, 3 7'Cei*im&Zlttz 0 
C©£;ft£-t?*7> h7> K? v ?Xizlm?L, 1 2°C> 3 0&m> 

(4) W-H?NCE4 cDNACDPCR&fc: 
^LfccDNAO^U g*§HSiU gWCcDNAO^PCR-Sl: 

NCE4-CN: 5' -ATGCGTTCCTCCCCTCTCCTCCGCTCCGCC-3' (30mer) 
NCE4-CC: 5' -TACAGGCACTGATGGTACCAGTCATTAATC-3' (30mer) 

Al^gfc^Ls r^-f ^Mfln*.. dNTP#£T> 9 4°C\ 1 0#FJJ3& 

^^ffofco Taq *ij>7- tf (>;=J>tV> hTaq , ^Mm±M) 

*tDL 9 4tim 5 0 o C2#F*1> 72°C37TFa1coS^#^30HIM0ig 

0. 9kbpO^$T*ofc e Cft*i*y-;l^<fcfcJ:!>*«U P T7 r 

;U_ T ;^-+-;h (y^<s;*>4tl!D IC«fc!>*ci->ffcLfco CO^x 

S K*pCNCE4i L7t 0 
(5) cDNA©»E^J|W 

KpCNQE4*2M*3Mfc*M*AT7;U#'j£teU .r* y- 

■eSJS^-a-yto fufg^&^^-JJNEGOK MNEG02> MNEG03> MNEG04. MNEG05 N 
MNEG06> MNEG07, 43J;CfMNEG08tt btfK* * hflSf*©J.^<— ^;U^-f^_ % 
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<£-©Mm> 5 6b p©— o<ZM > h o >*<#^EL)to E?US#3 ©ETUfcfc^ 

C^ttE*JS^3©E*]ffiM*-C*S) o 
Introne : 4 5 3-4 5 8. 5 0 6-5 0 8> 4 9 1-4 9 7 
Hflfeff] Bl KpMKDOl (Pfm 

(1) ^7X; KpUCll 8BN©fpM 

PUC118 D N A 1 a g£BamH iCcfcoTM. 7 x / -^SlCk 
UX*i(pH8. 0) > ImM EDTA) lIWd?L/: 0 :©DNA^ 

dna ^7>f>/ K SfiiittJID K«koT5^«*sp7t<bLfco 

m ^nrr^zB^^rE. coli competent cells J Ml 0 9 (^BiStiJB!) * 
{^WtteglLfeo ifi^i:o^T, 1 0 0 // g/m 1 ©T> t°v >J >\ 1 m 
M IPTG, 0. 0 0 4% X-ga l£^LB^ig*fc C196*V<?b 
>> 0. 5%8Sx+x, l%NaCU 1. 5 ±T^WRTtl"e> 

a6©3D--^it5t>©*aau :n^ii:> 1 oo#g/micDT>t° 

*>U >£#tTL Bigifi (1%^'J^^h 0. 5%gSx+^, l%NaCl) 

ffll>T:/5Xi KDNA^-lslJRL^o :®77X; KDNA^BamH I»w«fcoTttJ 
#rU 0. 8%T^'a-xy;Um^»U-^Ls pUC118 DNAcDBamHI 
^tiL%:W&l,1z-77* i KD N A %:M1R L tz 0 :©77X; KDNA^pUCl 
1 8 BNiLfco 

(2) 77^; Kp UC 1 1 8 B SNOfPtSf 

PUC118BN DNA 1 u g£Sph I «k oT«U ftui££|5lJ£<D# 
fcfclc«fctK pUCl 18BN OSph I fflHa*tt«Lfc"75* S KDNA^ife, 



Patent provided by Sughrue Mion, PLLC - http://www.sughrue.com 



WO 98/03667 



22 



PCT/JP97/02560 



KDNAfcpUCl 18BSNiL^ 0 
(3) ^7X; KpM2 1©f^8 
(A) t^7-y'NCE 28LfcT<Dmm 

7 * . * > v U - 1 2 6 4 9 2*&*K:K*©#fc 

'Of^-U-^-flWiLT±«Kl. 4Kb, Tli:0. 5Kb©DNASS?ij 
£#1-££§3. 4Kbp<DPst I ~Xba I ftOpUCl 1 8BSN© 

Pst I ~Xba I «ffi|ca*ILfco KDNA*pUCl 1 8BSN 

(b) ^77; Kpuc 1 1 8 b sn-p xcDgK&mmmmt&m 

NC E 2«fe?(0N*Ji©T**J:C«ttih3 K>®tCT«l:, BanH Iffiffi* 
^T©J:^K«ffiJfijaeilfcJ;0»ALfco ^7X; KpUCl 18BSN-P 
Xfcj^E. coli J Ml 0 9**3BKIElftLs $6KW-7t-^M13 
K0 7«»4*fct, T>t"->.j> N f>**tl*tll 5 0 t* g/m 

K 7 O0S/m-l©«£-e^t;3Oml<D % 2 xYTft^Jd! (l. 
hh'J7 B h> > 0. 8%H^*X. 0. 5«NaCl)i:^T, 3 7°CT\ 
16-2 0e«BMMILfco JMJt«Nk*)Ml 3©-*iDNA (s sDNA) * 
tmWLtz 0 COssDNAi2l0^t'J^^U^fK^\ 

MNC-02 5 '-GAGCGCCAGAACTGTGGATCCACTTGGTGAGCAATG-3 ' (36mer) 
MNC-03 5 ' -TCCGCCGTTCTGAGCGGATCCAGGCGTTTGGCGCG-3 ' C35mer) 
WHttBWttiLfcDNAa^ E. coli TGlCfAU m<btltz& 
00// g/m 1 OT>h°i/U >£#t?L BJgifc (l^^U^^fs >> 
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0. 5»x + z, l%NaCl) fCTJMU ^X; KDNA*@CLfc 0 

^ KDN A£BamH IKJ; TtfJWr U 0. 
I&IC^U ^7X; KpUC 118 B S N - P X {C r. ©BffBamH iaSftO^AStl 
fzzfyxi KDNA£«$RLfco C©^.* 5 h'DNA*pM21Hfc, 
(4) b'NC E 3flM£?<D*fl§ 

4>ftJ©Humicola grisea &3fc©-tr d tf:t'W K n -7 — -tf itfe-? (de Oliviera 
Alzevedo, M. et al. , J. General Microbiol. , 136:2569-2576. 1990 ) ©E^J*fei 
C> 7 ^ • -f >V l/>74*OtDl:t/^ Ko^--tfite^ (NC E 3) 

^pcRSia^SiL/:o 

(A) 7-VADNA©#|! 

fuieHWJ A 3 <D-fife\z& i97;3-7-f>7U>xMN2 0 0-l O^V 
ADNA^fco 

(B) ■fe;l/5~tfNC E 3itfe^©P C RSKcfc Sigifl 

Humicola grisea i*o-bnt'*A>f Ko 7 —VMBrf-toW&fe k i (c N 7 $ 
3-7-OvU>XONCE Sitfe^P CRi*fCj;OmiStLrto CCNCE 
3^PCRli^; ^7X; K p M 2 1 OBamH V- 1*£<&£>-£ X 

aUST?*5J:-5(C x &:/5-f ^-{cii I: #>BamH Iffl&^iffrJg-eKfrLfco 

^7^7-iL TOT© ck 9 ttE#l©£J5fci- 'J^?^Utf K£fP8!| L fco 
MKA-05: 5 ' -GCCGCCCAGCAGGCGGGATCCCTCACCACCGAGAGG-3 ' (36mer) 
MKA-06 : 5 " -TGATCGTCGAGTCAGGGATCCAGAATTTACAGGCAC-3 ' (36mer) 

P C RSJStiLA PCR Kit Ver. 2 CSffi&ttlg) £lTO^J*Wofc 0 
£-f\ mft<OJ5&\z£iX'&Z>tlZ'7 ;3-5"T>V b>X7"V ADNA1 ^ 
glwjtfU ^5^T-^1 M N 4 0 0 dNTP . LA Taq+° U ^ "y-M 2. 
5U*Hd^ 9 4°C1#RS. 5 5°C2#HU 7 2 °C 3 ^BB©RiB*f** 3 0 Sit 

vm-tztic&iommLtzo 0. 8%T*fa-xyjiwm.mhv>i&^ 1. 6k 
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n^ P T7 t^-t-k**- + „ h (y,<s;*>tt» »=aastfco :<o^7 

5 KDNA*pK2 1 i L*z 0 

(5) 7*77; KpKM0 4Of^K 

^7;KpK2 1DNA*BartIi:J:,T^bU 1. BKbpODNA* 
tf-fclsUKLfco SHC. ^Xi- KpM2 lDNA^BamH I(c «fc -o TM-ffc U £ £ 

c7otTio»w*Hu nmgmzxmzitizo :nwft#o7**>j 

»T^n-xy^««c«itej;i}driU 5. 2KbpODNA»r^*|gMKLfeo 
PK21ME01. 6Kb P ODNAKffripM2i**©5. 2KbpODN 
A»f^iiL> 7^7X; KpKM0 4^||fc o 
(6) -f^^i KpMKDO 1 (DftWl 
£-?\ AKKD^mz^Qmbtl £ Aspergillus nidulans^^trp C i&B=P(DZf 
o*-*-*,fcOf*-$*-*- (Mullaney. E.J. et al. , M61. Gen. Genet. 
199:37-45.1985 ) *ffll*T\ #§SBS 5 9 - 1 7 5 8 8 9 #&tffi:K**ftTO 

^flSSLfco Ctl^> 7^7X; KpKM0 4OXba I S&fftlcSIA U 75 X~ 
KpMKD 0 1 *fBKLjto 

H»|B2 : KpMKDOIIUS?; 3-7 . 4 > y u > XCOmW^ 

(i) kpmkdo lnmmBmrnm&vwm 

-75 KpMKDO 1*7 ;3-7 • >f > V U >X{c#Af 5^36{c, 
pMKD0 1©^iSiSJ»iSJLfc o pMKDOIH coli J Ml 0 9 
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ClAU lOO/ig/ml07>t s i/'J>^lOOml L Bigififcfc^ 
T— tt> 3 7°C-e^Lfc 0 Ht.n^$^7U+^U7^ 4->yh (7r 
;U^;>7'<-Y*7^J±§!D £JB^T*Sa!! U 1 // g/// 1 ©pMKDO 1 -f^T, 
^ KDNA*»fco 

(2) 7;3-7^>VL/>^©Jgf|£» 

7;n-7'^>Vl/>XMN2 0 0-l^ (S) igitfe* 3 7°C-?igSU 
2 4^rWSt> 3 0 0 0 r pm, 1 Oftffi&bftmizk «9*ML^ 0 (S) 
igltfioMl^ (N) igifeK^n-X (3. 0%) £flD*.s ^7t"t 

0. 4 5 //m07<( -;l/*-T?itiftLrt::/n h^x bfcW&mfe (5mg/m 1 
Novozyme 234 (N L I fcfcSSD > 5mg/m 1 Cellulase Onozuka R-10 (ir? 
JlhttM) . 0. 5M ->i-^ o-X) lOmlClILto 3 0°CT'6 0~ 
9 0^MS»Lx"B**^a bf^T, Mb2-fcf-fc 0 LfcllL 
250 0rpm. 1 0 #l!»k#llt LT^o h 75 X h fcHUK U SUTCII 
(0. 5M->a-^o-^ lOmMM^^'^^A, lOmMh'JXlS 
(pH7. 5) ) -eft^Lfco 

(D10 0//1 fc*f L lO^gODNA (TE) ^ (10^/1) ^:JD^. 
5^Ph1»I1 Lfco o ^'{-> 4 0 0 MOPE G»f& (60% PEG400CU 
10mM»^->7A, lOmMh'J^ii (pH7. 5) ) £ftl;t. tK^jc 
2 0^PI»ilfc^ 1 0 m l©SUTCjS«^«rjD^ 2 5 0 0 r p nu 10 
^F^C^HILrro m#>tz7n ^7°57 h£lm lOSUTCMCliLfc 
4 0 0 0 r pmT»5 3f B 1£>fr#mLT> 1 0 0// lOSUTCtff 

ia±Otoa^j!JD^^^a h^X h£> 2 0 0 // g/m 1 tDM/o7^» 
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l%mdZ (pH6. 8) ) ±ic. YMGDc^ii^JcS^L. 3 7tf 
(3) pMKDO l»=«k*JBKIE*M5©*«feJ:afSDS-PAGElc:«t«ff 

WE©«fc-5K:/*x$ KpMKD01*753-7 • -f >71/>XMN2 0 0 
-lfc#AU M/DT^»itt^ t ^ 50 ^ L/:o (N) 

^^3 7^5BWJMLfco l#bn^i±^SDS-PAGElU^ 

«fLfcic* N pMKDona«jgmai«i0^5^ o - > , : ^^ NC 

(4) «**^.NCE3©N5ia»T5yieK»©|^JE 

SDS-PAGEOg*^ T^fifgSIL/^ >^g,<> WNCE 3»fi 

£Lfca ^ ^iNCE3was«i^&fli&nfe«ai±aic-3^Ts buis^ 

£S tt t°- * £ Jfctt L fc 0 N C E 3 milflE £ H rmcmu L T ^ * t 3 - * £ 
#1KU mmmttio Ctt*d>»©7kf;:»|?U 8 %Gel SDS-PAGE mini (-r 

?=>i±m) ^KimL^ -ti*roa2i5tt«(A2o^rj*icse-,TPVDFjK 

h3ftfc^mO*?tijU yo 7 Podell. D. N. £ 

(Podell, D. N. etal., Biochem. Biophys. Res. Commun. ,81:176,197 

8 ) \zm\ mmum%&&&fa&Lt: 0 >'*9K&vj*)mu f p 

SCO. 5%4?'Jt*-^foiJK> (^14 0, 0 0 0. S/^tti© /10 
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Pfu \£u?A,9 l yWLTU^Zf+y—e CSffli6tt«D lc«fc OtfttfN^iWS 
£8fe*U TK-eife^ mi£L/Co cni^of'f >->-^>1f- -Model 492 Mfc 

N^^T ^ JW&&\ : Asn-Cys-Gly-Ser-Leu-Thr-Thr-Glu-Arg-His-Pro-Ser 
-Leu-Ser-Trp (15^S) 

C©N3fcM'jT ^ yl?S5^J(i. ^57; KpMKD 0 1 <D*gSK?iJ^ £*£5££ 
tfNC E 2. NCE 3»^^^S©7; 7 &I2?«J £ — % L fc 0 
(5) pMKD0 1i:«t^II5gH*©FPLCJ;5i¥ffi 
fu^Ojifc SDS-PAGETNCE3 0^§£5iJ&<5tI8 $ tl/c 5 ? u - 
i£*±ig££ t>{C^S-r^7ti6JC, F P L C yXfAtJ;i?*7A^DT Y7=y 
7<-&n^tz 0 *©^#(±buSB (4) iP— iLfco NCE3©t-^^M 



gig 



NCE 3£^»* 


7;n-7.^>Vb>ZMN2 0 0-l GR80 


0. 4 6 g 


^^^-■7 •^>VU>XpMKD01 


1. 8 g 



mmm b 3 : ^5 kpegdoi (Dfm 

-f^Xi KpMKDO l£BamH H:<t;otML> £<iK7 OtO.^lliJ; 
o T«4P5»**3fee$ tl\ fl£ V >^{k^S U 8. 2Kb p <£> D N AWi^^ 
Lfz 0 
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4«« i ©^|*fci»c % PCRSKJ:0NCE4*fi?*i|(ILfc o CCDNC 
E4^PCRIM, fyaE^*; KpMKD01O8. 2 K b p ©BaoH I 

m m- c ^ u - a *^*)-a-Tas»-c 15^1:, & BamH 

NCE4-N : 5- 'CCGGTGTTGGCCGGATCCGCTGATGGCAAG-3 ' (30mer) 
NCE4-C : 5 ' -TAAGGCCCTCAAGGATCCCTGCGTCTACAG-3 ' (30mer) 

1 U g izltf U 75^ 400^M dNTP „ Pfu DNA # y > ^ - 

*"Uh7^->->ttSa) 2. 5U*flD* N 9 4tl#Ek 5 5t2m 7 2 
t3#W©KJ6*#*2 5@^«9igt-Ci{cJ;f9 0. 8 K b p (DD N Amfrtm 
$3Ltz 0 CCDO. 8Kb p©DNAWffiv£-lpJJKU C;fv£iiui&P MKD 0 1 © 8. 
2Kbp BamE I Rtf-KittS L fc e KDNA^pEGDO 1 £ Lfc a 

^iMB4 :7 7 X; KpEGDQ lOM 

(1) 7°7X; Kp EGDO 1 Ci57 • l' > V U >XCD?g@$£g| 

Kp E GD 0 1 i:J;57 5 n-5> • -f > V U>XMN 2 0 0 - 1 <D& 

nmmt, mmmB2<Djjmz'&->Tft^ti 0 P e g d o i nmrngmm 

fcS^TOU ljug//z l©pEGD0 1^77? KDNA^fc 0 d©pE 
GDO 0 // lkfflLT, 75a-7->f >VU>XMN2 0 0 - 1 £ 

* (N) «*T?3 7'CTf5BW*»Lfco H6nfcJ«±l*SDS-PAGE 
(cj;0^rL/ciC5. p E G D 0 1 IU ZJ&m&mftcD o % l O^n-xcfe 
V>T. NCE4ii£*ft5^>^f^> «k 0 l 0-1 6^±i»DL 
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(2) *I*g|*.NCE 4©N^jg7; y&HS©fsl5£ 

^L^o *1\ ««U N C E 4IfgS»^^»^fc«l±i* F P L C <>7f 
»JB 2 tm—t Ltzo NC E 4^^«clcfc^T#{CifSDLT^S t°-^^ 

N5j5SgT i VTOiJ 1 : Val-Val-Glu-Glu-Arg-Gln-Asn-Cys-Gly-Ser-Ala-Asp- 
Gly-Lys-Ser -Thr-Arg-Tyr-Trp-Asp (205£X) 

N5|5^T ^ y&IE?'J2 : Asn-(Cys)-Gly-Ser-Ala-Asp-Gly-Lys-Ser-Thr-Arg-Ty 
r-Trp-Asp-(Cys)-(Cys)-Lys-Pro-Ser-(Cys) (20^S) 

c tiz><DNm&QH7 i j mmmt, ^x; kpegdoi ommm.?i\frz>m 

( 3 ) pEGDOl (C «fc Z>teWfcWfc<D F P L C «fc £?Fffi 
fu^O^fC SDS-PAGE7NCE4 ©^dKB^jWHB* tut 5 * o - >© 
*g*±a*3 «iicSS-r^^J6JC> F P L C yXfA-e7)7i7D7 h/77^ 
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2 ^k. 

. NCE4^tt* 

7;3-7.^>vi/>XMN2 0 0-l (mm 0. 28g 

y i 3-7 ; ^>Vl/>XpMKEG 1 4, 5 g 

* : 1 L * 19 ©4iit^ £ Q 

^SfefflJ B5 Kp I EDQ2 (DfeW 

( 1 ) ^77^ Kp I D 0 1 OftM. 

Kp EGDO l£Hind III43«fctfBamH Ii;j;,tMU 7. 2Kb 
pODNAefttf-SrlHlJKL/Co 

#§S¥8 - 5 6 6 3§^i:»^rflbn5 7;n-7-^>v 

^n*-^-*j;oeS/r^UKW*3- Kf **»©DNA*i»*Lfco CON 
CEl^nt-^-fejiofS/r^WEWidtrPCRMftt. fui£?°5X* Kp 
EGDO 1© 7. 2Kbp©Hind III-BamH ISffrlC^T^ 5 £ 9 fC, £.7" 5 
f v~|cfi*^^L:a6Hind III, BamH IS^*£frJ£TRtr L fc e 

PNCE1-N : 5 ' -GTCATGAAGCTTCATTAAGGTACGTATGCAAC-3 ' (32mer) 
PNCE1-C : 5 ' -GGTGATGGATCCGGCCTGCTGGGC AGCG ACGC- 3 ' (32mer) 

NAl //g*fU r^^^-^l /iM, 4 0 OittM dNTP N Pfu DNA 9- 

■tf2. 5u*an^ 9 4tim 55^25^ 7 2t47}n®m#^2 

Hind IILte«ktfBanH IK* oTflMbU 1. 5 K b p ©D N ASf^lUiR L?to 
cn^HuiBp E GD 0 1©7. 2Kb pOHind III-BamH I»fJtJC^L^o C 
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<D-?=7 X; KDNA*p I D 0 1 t Lfz c 
(2) -f^X i KpIEDO 2<Dfm 

^x; Kp i do i£BamH uz£^xffi4tu 7 ov^msmx-mmm^^ 

&7££it. yfflmmLti^ 8. 6 K b pcDDN A#rJt*®J&L*:o 

-fyXi Kp EGDO l*BamH He «£ o Tffljt L tz'&^ NCE4itfc^£ 
^tsO. 8Kb p<DDNAffifr%\s}iRLt: 0 2o©«I^L^ ^7X;Kp 
l'ED02^#/:o 

HMffl] B6 :^77; KpIED02 CDfgg 

(1) ^7X; Kp I ED02i:«k57;3-7 • -f > V V > X ©J£K!E& 
75 7^ Kp I ED02i:J;S7;3-7 • Y>7b>X(DMN2 0 0 - IB 

^»JB 2©7j*{^oTff o/co P I EDO 2 <DiSz£Jgfif§!l 

^SSrlSSlI U l//g//il©pIED0277X; KD N A *?#*: 0 d CD p I 
EDO 2t£&* 10^1 &m LT> 7 $ 3-^ • > V U>XMN 20 0-1* 
BKe&U / D7 ^ ^ >itt*^tiHi^* 5 OtfcH&L/Co Ctlb 
* (N) JgftfiT? 3 7 °CT- 5 HfflmmttZo £ttfcig*±S£ SDS-PAGE 
{-cb^^tlrL/c<bC6. p I E D 0 2 IC £ £ fmfc&W® o % 5 ^ n->{cfc^ 
T> NCE 4£if5£SnS* H>\ «9 5 ~ 1 0 i&ma LT^ 

(2) M^-gi^N c e 4cdn^t i ymmm^m^. 

SDS-PAGEOM^^, N **5!§iL*:;? >/^fC'<> K^NC E 4it£ 
^i*T*6Ci*5ilg-r5A:i6(c: N CCD* >/<^«cdN^T ^ y®?g2?ij*& 
^L/-Co 23% H^JB 2 £|H]^©;£&lcj;oT. ^t*:fc<£t>'N C E 4ffi£5U$ 
^bibtl^i±gi:o^TF PLC •^fA^ll^^^Ai' a^- ^77 
^-*fT><\ NCE40h-7^U #C*£i£*§L*: 0 cn*li>gcD7K{C7§^? 
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N^SgT $ 7 g*£?ij : Gln-Ala-Gly-Ser-Ala-Asp-Gly-Lys-Ser-Thr-Arg-Tyr-Trp 

-Asp-ccys) (ibmm 

*l**A5-*NCEK N C E 41^ 07 ; > W^iHRLfco 

(3) PEGD0 2KiJ:*»maMj|soFPLCJ:*ffF* 

SDS — PAGETTNCE 4©^«S»<WB$nfc 5 9 n- 











^3^ 




■ N C E 4*S&* 




'^>VU>XMN2 0 0- 1 G&ft) 


0. 2 8 g 


7 5 


-f>Vl/>7p IED02 


2. 9 g 
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e n n 

mm^- : i 

E?iJ©g£ : 2 2 8 5 

e^jom : m& 
mam. : ~*m 
btfai?- -.mm** 

E?iJ©S3§ .-Genomic DNA 
&M 

: Humicola insolens 
E?»J©#^ 

W^&Wcftxi^ : s ig peptide 
#£&S : 3 1 0. . 3 7 5 

mWLZ&feLtilsm : E 
W^^^tt^ :mat peptide 
#£feS : 3 7 6 . . 1 8 9 0 

: E 

W&&m.i-ttt : i n t r o n 
#£&g : 8 8 0. . 9 3 6 

«r**ftJ£Lfc2ra: : E 
ftm&m-ti^ : intron 
^SteS :. 1 2 9 0. . 1 3 48 

#»**HeLfc2Ffc: E 
^m^^-riH-^- : i n t r o n 
#3Efeg : 1 7 8 0. . 1863 
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W&.^^tt^r : cleavage-site 
W&tilM : 2 4 0 . . 2 4 5 

fl&Oftfg .Sail 
W&'&^ct IH-f- : cleavage-s i te 
: 6 0 3 . . 6 0 8 

momn s a i i 

f^MS^^-^fcl-f- :cleavage-site 

&&&& : 7 6 0 . . 7 6 5 

ffc^itfB : S a 1 I 

W4k^^rt%^- :cleavage-site 

#&ftfi : 1 152. . 1157 

ffe©tf IS : K p n I 

#^£^-^iH-f- : cleavage-site 
#&feg : 1 2 6 7. . 1272 

ffeCDlf ^ : S a 1 I 

g£?<J 

TCTCCAATAA CGACGAAGCG ACTGTTGGCT GATCAATTAG CTGGCGATGG GTCTGTGGTA 
TGGAACGTCG GCTGAGTCTT CCATCTCCCA CCGTAGACGT GTTCCGCGGA TCAAGGTCTC 
CCGCTCCGTA ACCGCCCAGG TGGCTCGGTT CTTGATGATG GGAAAGGGGC CGACGGCAGT 
ATAAAGAGCC ATGGAAGCAT CCCTCGAGGC CGGAAGGAAA TCTTGCTCAG CCACCCGCAG 
TCGACTTGTC TATCGATCTG AGCAGCAGTT GACCGGTCTT CTCTGTCATC TCAGCAGCAG 
TCTTTCAAG ATG CAG ATC AAG AGC TAC ATC CAG TAC CTG GCC GCG 
Met Gin He Lys Ser Tyr He Gin Tyr Leu Ala Ala 
"20 -15 



60 
120 
180 
240 
300 
345 
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GCT CTG CCG CTC CTG AGC AGC GTC GCT GCC CAG CAG GCC GGC ACC ATC 393 

Ala Leu Pro Leu Leu Ser Ser Val Ala Ala Gin Gin Ala Gly Thr He 

-10 -5 1 5 

ACC GCC GAG AAC CAC CCC AGG ATG ACC TGG AAG AGG TGC TCG GGC CCC 441 

Thr Ala Glu Asn His Pro Arg Met Thr Trp Lys Arg Cys Ser Gly Pro 

10 15 20 

GGC AAC TGC CAG ACC GTG CAG GGC GAG GTC GTC ATC GAC GCC AAC TGG 489 
Gly Asn Cys Gin Thr Val Gin Gly Glu Val Val He Asp Ala Asn Trp 

25 30 35 

CGC TGG CTG CAC AAC AAC GGC CAG AAC TGC TAT GAG GGC AAC AAG TGG 537 
Arg Trp Leu His Asn Asn Gly Gin Asn Cys Tyr Glu Gly Asn Lys Trp 

40 45 50 

ACC AGC CAG TGC AGC TCG GCC ACC GAC TGC GCG CAG AGG TGC GCC CTC 585 
Thr Ser Gin Cys Ser Ser Ala Thr Asp Cys Ala Gin Arg Cys Ala Leu 
55 60 65 70 

GAC GGT GCC AAC TAC CAG TCG ACC TAC GGC GCC TCG ACC AGC GGC GAC 633 
Asp Gly Ala Asn Tyr Gin Ser Thr Tyr Gly Ala Ser Thr Ser Gly Asp 

75 80 85 

TCC CTG ACG CTC AAG TTC GTC ACC AAG CAC GAG TAC GGC ACC AAC ATC 681 
Ser Leu Thr Leu Lys Phe Val Thr Lys His Glu Tyr Gly Thr Asn He 

90 . 95 100 

GGC TCG CGC TTC TAC CTC ATG GCC AAC CAG AAC AAG TAC CAG ATG TTC 729 
Gly Ser Arg Phe Tyr Leu Met Ala Asn Gin Asn Lys Tyr Gin Met Phe 
105 110 115 
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ACC CTG ATG AAC AAC GAG TTC GCC TTC GAT GTC GAC CTC TCC AAG GTT 777 
Thr Leu Met Asn Asn Glu Phe Ala Phe Asp Val Asp Leu Ser Lys Val 

120 125 130 

GAG TGC GGT ATC AAC AGC GCT CTG TAC TTC GTC GCC ATG GAG GAG GAT 825 
Glu Cys Gly He Asn Ser Ala Leu Tyr Phe Val Ala Met Glu Glu Asp 
135 140 145 150 

GGT GGC ATG GCC AGC TAC CCG AGC AAC CGT GCT GGT GCC AAG TAC GGC 873 
Gly Gly Met Ala Ser Tyr Pro Ser Asn Arg Ala Gly Ala Lys Tyr Gly 

155 160 165 

ACG GGC GTACGTTCTC TCCGTCCCGC CCCTACCAAA AGTATGACTC GTGCTGACGT 929 
Thr Gly 

TTG ACAG TAC TGC GAT GCC CAA TGC GCC CGT GAC CTC AAG TTC ATT GGC 978 
Tyr Cys Asp Ala Gin Cys Ala Arg Asp Leu Lys Phe He Gly 
170 175 180 

GGC AAG GCC AAC ATT GAG GGC TGG CGC CCG TCC ACC AAC GAC CCC AAC 1026 
Gly Lys Ala Asn He Glu Gly Trp Arg Pro Ser Thr Asn Asp Pro Asn 

185 190 195 

GCC GGT GTC GGT CCC ATG GGT GCC TGC TGC GCT GAG ATC GAC GTT TGG 1074 
Ala Gly Val Gly Pro Met Gly Ala Cys Cys Ala Glu He Asp Val Trp 

200 205 210 

GAG TCC AAC GCC TAT GCT TAT GCC TTC ACC CCC CAC GCC TGC GGC AGC 1122 
Glu Ser Asn Ala Tyr Ala Tyr Ala Phe Thr Pro His Ala Cys Gly Ser 
215 220 225 230 
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AAG AAC CGC TAC CAC ATC TGC GAG ACC AAC AAC TGC GGT GGT ACC TAC 1170 
Lys Asn Arg Tyr His He Cys Glu Thr Asn Asn Cys Gly Gly Thr Tyr 

235 240 245 

TCG GAT GAC CGC TTC GCC GGC TAC TGC GAC GCC AAC GGC TGC GAC TAC 1218 
Ser Asp Asp Arg Phe Ala Gly Tyr Cys Asp Ala Asn Gly Cys Asp Tyr 

250 255 260 

AAC CCC TAC CGC ATG GGC AAC AAG GAC TTC TAT GGC AAG GGC AAG ACC 1266 
Asn Pro Tyr Arg Met Gly Asn Lys Asp Phe Tyr Gly Lys Gly Lys Thr 

265 270 275 

GTC GAC ACC AAC CGC AAG TTC AC GTAAGTTCCC TGGCCGCCTC TTCGACGACG CAG 1322 
Val Asp Thr Asn Arg Lys Phe Th 

280 285 
AATGTCCGGA TGCTGACCCA GAACAG C GTT GTC TCC CGC TTC GAG CGT AAC AGG 1376 

r Val Val Ser Arg Phe Glu Arg Asn Arg 
290 295 
CTC TCT CAG TTC TTC GTC CAG GAC GGC CGC AAG ATC GAG GTG CCC CCT 1424 
Leu Ser Gin Phe Phe Val Gin Asp Gly Arg Lys He Glu Val Pro Pro 

300 305 310 

CCG ACC TGG CCC GGC CTC CCG AAC AGC GCC GAC ATC ACC CCT GAG CTC 1472 
Pro Thr Trp Pro Gly Leu Pro Asn Ser Ala Asp lie Thr Pro Glu Leu 

315 320 325 

TGC GAT GCT CAG TTC CGC GTC TTC GAT GAC CGC AAC CGC TTC GCC GAG 1520 
Cys Asp Ala Gin Phe Arg Val Phe Asp Asp Arg Asn Arg Phe Ala Glu 
330 335 340 
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ACC GGT GGC TTC GAT GCT CTG AAC GAG GCC CTC ACC ATT CCC ATG GTC 1568 
Thr Gly Gly Phe Asp Ala Leu Asn Glu Ala Leu Thr He Pro Met Val 

345 350 355 

CTT GTC ATG TCC ATC TGG GAT GAC GTATGTGGCA CCAACCTCCA ACCGGGCATG AG 1624 
Leu Val Met Ser He Trp Asp Asp 
360 365 

ACCTGTACTG ACGTGTCTTG ACAG CAC CAC TCC AAC ATG CTC TGG CTC GAC TCC 1678 

His His Ser Asn Met Leu Trp Leu Asp Ser 
370 375 
AGC TAC CCG CCC GAG AAG GCC GGC CTC CCC GGT GGC GAC CGT GGC CCG 1726 
Ser Tyr Pro Pro Glu Lys Ala Gly Leu Pro Gly Gly Asp Arg Gly Pro 

380 385 390 

TGC CCG ACC ACC TCT GGT GTC CCT GCC GAG GTC GAG GCT CAG TAC CCC 1774 
Cys Pro Thr Thr Ser Gly Val Pro Ala Glu Val Glu Ala Gin Tyr Pro 
395 400 405 

AAT GC GTACGTTACT ACCGCCGCTG CATCTGCAAA AAATACCGGT GCTAACCATT GTG 1832 
Asn Al 

410 

CAG T CAG GTC GTC TGG TCC AAC ATC CGC TTC GGC CCC ATC GGC TCG ACC 1881 
a Gin Val Val Trp Ser Asn He Arg Phe Gly Pro He Gly Ser Thr 
415 420 425 

GTC AAC GTC TAAGCTATCA CGGCTCAAAA TCAGCGCCCG CTCTGCTCGT CCTGTTCGGC 1940 
Val Asn Val 

GCGCCAGTAG GGGGATATGG GGCATTTCTT TGTTCAAGCA TTTTTCTCTT CGTCCTGCTA 2000 
CATATTGAGA TTGTGTATCG TATGCACGCG TACAAAGTAG AAACCATGAT CAAGTCTCAT 2060 
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TGAACTATAC TGCTGCTCCC AAGATTAATT ATGCCGTAAT GGTCTGTTTG CTTTTTTTTT 2120 

TTTTTTTTTT TGGTGCACTT GATCGTGTGG CACATTGGCC GCTGTATGTA TGGCTTCCCT 2180 

CAATCGCCGA CTGACTCAAA ACGGCAGTAC AACAGAAGCC CCATTGCATC AGAAGAGAGG 2240 

TTTTATAATG CCATGAGGTG TTCTCAGATG AAAGACTTCG AGTAT 2285 
K?iJ#-t : 2 
g£?ij©:g£ : 2 4 0 9 

mwcow. : mm 

WM<OW& - Genomic DN A 

mm 

'■ Humicola insolens 

: s i g peptide 
?f SttS : 3 8 9 . . 4 5 7 

ftrnzmmtttm: e 

%fWL&&tfc J % : m a t peptide 

: 4 5 8. . 2 0 9 8 
fflk*&mtti15m : E 
W&Zm-fsZ^ : i n t r o n 

: 4 7 8. . 5 3 5 
ftft^Lfc^: E 
4**£S"*-!» : i n t r o n 

: 1 0 3 0. . 1141 

<ftmz&mitii5& : E 
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i n t r o n 
: 1 7 6 2. . 1 8 1 5 

*mz&mitt& : E 

i n t r o n 
: 1 9 9 0. . 2 0 4 4 

Lfc^rft: E 

eavege — s i te 
#£ffiS : 6 8 8 . . 6 9 3 
ffe©tf m. : S m a I 

W&^^k'ftim- :cleavege-site 
##ffig : 1 2 5 3. . 1 2 5 9 
flfc©ttf8 : BaraHI 

#afc3r&"$"iE-^- : cleavege-si te 
#£ftfi : 1 5 0 5. . 1 5 1 0 
ffeOflWR : B g 1 II 

#^^-^t"f5-f- : cleavege-si te 
#£ftg : 1 6 4 3. . 1648 

momm :stui 

TGCTGGACCT TGGATGCGTC TGCCGAGCTG TGCGTGCGGA AGAGTCGAGC GTGATTCCGG 60 

CATCACTGAA CACTCGCTGG TTGCTGGTTC TGGAAGCGGT ACGTCCGGCG CAAACCAGCA 120 

AAAGCAGGTT TGCGCTGCCT TGGCCTCCGT GAGAGGCATG ATGCCAAGGA TGAATGGTTC 180 

CTCTGCGGAC TCAACCATCC GCACTTCGAG CCCGACGATC CGGGCCCCCT GCTCCGGCGC 240 

GGAGAGCCGT GGTGAGCTCC AAGTGATGCG GAATCGGTGA TGTGCAAGAT GCGGAGGGCA 300 

TAAAAAGGCT GTTTCCCACA CGAAGCATTC TCCAGCTTGT TTCCTCACGG CACACGGTCA 360 
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AACAAGTCTG TGCAGTACCT GGGACAAG ATG GCC AAG TTC TTC CTT ACT GCT 412 

Met Ala Lys Phe Phe Leu Thr Ala 
-20 

GCC TTT GCG GCT GCC GCT CTC GCC GCT CCC GTT GTT GAG GAG CGC CAG 460 
Ala Phe Ala Ala Ala Ala Leu Ala Ala Pro Val Val Glu Glu Arg Gin 
-15 -10 -5 1 

AAC TGT GCC CCG ACT TG GTGAGCAATG GTGTTTCATG GATCGTGTCT TTGGATGTGC 517 
Asn Cys Ala Pro Thr Tr 
5 

GGCTAACAAC CATTCCAG G GGC CAG TGC GGT GGC ATC GGC TIC AAT GGC 566 

p Gly Gin Cys Gly Gly He Gly Phe Asn Gly 
10 15 
CCG ACT TGC TGC CAG TCT GGT AGC ACC TGC GTG AAG CAG AAC GAC TGG 614 
Pro Thr Cys Cys Gin Ser Gly Ser Thr Cys Val Lys Gin Asn Asp Trp 

20 25 30 

TAC TCC CAG TGC TTG CCC GGT AGC CAG GTC ACC ACG ACC TCG ACT ACG 662 
Tyr Ser Gin Cys Leu Pro Gly Ser Gin Val Thr Thr Thr Ser Thr Thr 

35 40 45 

TCG ACT TCG AGC TCG TCG ACC ACC TCC CGG GCC ACC TCG ACC ACC AGG 710 
Ser Thr Ser Ser Ser Ser Thr Thr Ser Arg Ala Thr Ser Thr Thr Arg 
50 55 60 65 

ACC GGT GGT GTG ACC TCG ATC ACC ACT GCT CCC ACC CGC ACC GTC ACC 758 
Thr Gly Gly Val Thr Ser He Thr Thr Ala Pro Thr Arg Thr Val Thr 
70 75 80 
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806 



902 



950 



ATC CCT GGC GGT GCC ACC ACC ACG GCC AGC TAC AAC GGC AAC CCC TTC 
lie Pro Gly Gly Ala Thr Thr Thr Ala Ser Tyr Asn Gly Asn Pro Phe 

85 90 95 

GAG GGT GTC CAG CTC TGG GCC AAC AAC TAC TAC CGC TCT GAG GTC CAC 854 
Glu Gly Val Gin Leu Trp Ala Asn Asn Tyr Tyr Arg Ser Glu Val His 

100 105 110 

ACC CTC GCC ATT CCT CAG ATC ACC GAC CCT GCC TTG AGG GCT GCG GCC 
Thr Leu Ala He Pro Gin He Thr Asp Pro Ala Leu Arg Ala Ala Ala 

115 120 125 

TCG GCC GTC GCT GAG GTC CCG AGC TTC CAG TGG CTC GAC CGC AAC GTC 
Ser Ala Val Ala Glu Val Pro Ser Phe Gin Trp Leu Asp Arg Asn Val 
130 135 140 145 

ACG GTC GAC ACC CTG CTC GTC GAG ACC CTC TCT GAG ATC CGC GCC GCG 
Thr Val Asp Thr Leu Leu Val Glu Thr Leu Ser Glu He Arg Ala Ala 

150 155 160 

AAC CAG GCG GGC GCG AAC CCC CCG TAT GCC G GTAAGTGCGG TGTCACCACC 1049 
Asn Gin Ala Gly Ala Asn Pro Pro Tyr Ala A 

165 170 
ACCAACCCTA ACCCTGACCC CTGACCACCA CATCATCAAC ATCACCACAC ATCTCCCACA 1109 
TCATTCTGGA CGCAAATTAA CGCCAAATCC AG CC CAG ATC GTC GTT TAC GAC 1161 

la Gin He Val Val Tyr Asp 
175 

CTT CCT GAC CGC GAC TGC GCT GCC GCG GCT TCG AAC GGC GAG TGG GCG 1209 
Leu Pro Asp Arg Asp Cys Ala Ala Ala Ala Ser Asn Gly Glu Trp Ala 
180 185 190 



998 
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ATC GCC AAC AAC GGC GCC AAC AAC TAC AAG GGA TAC ATC AAC CGG ATC 1257 

He Ala Asn Asn Gly Ala Asn Asn Tyr Lys Gly Tyr He Asn Arg He 

195 200 205 210 

CGC GAG ATT CTC ATT TCG TTC TCG GAT GTC CGC ACG ATT CTG GTT ATC 1305 

Arg Glu He Leu He Ser Phe Ser Asp .Val Arg Thr He Leu Val He 

215 220 225 

GAG CCC GAC TCG CTG GCC AAC ATG GTC ACC AAC ATG AAC GTC GCC AAG 1353 
Glu Pro Asp Ser Leu Ala Asn Met Val Thr Asn Met Asn Val Ala Lys 

230 235 240 

TGC AGC GGT GCC GCC TCG ACC TAC CGC GAG TTG ACC ATC TAT GCC CTC 1401 
Cys Ser Gly Ala Ala Ser Thr Tyr Arg Glu Leu Thr He Tyr Ala Leu 

245 250 255 

AAG CAG CTC GAC CTC CCG CAC GTC GCC ATG TAC ATG GAC GCC GGC CAC 1449 
Lys Gin Leu Asp Leu Pro His Val Ala Met Tyr Met Asp Ala Gly His 

260 265 270 

GCT GGC TGG CTT GGC TGG CCC GCC AAC ATC CAG CCC GCT GCT GAG CTC 1497 
Ala Gly Trp Leu Gly Trp Pro Ala Asn He Gin Pro Ala Ala Glu Leu 
275 280 285 290 

TTC GCC AAG ATC TAC GAG GAT GCC GGC AAG CCC CGC GCC GTC CGC GGT 1545 
Phe Ala Lys He Tyr Glu Asp Ala Gly Lys Pro Arg Ala Val Arg Gly 

295 300 305 

CTC GCC ACC AAC GTC GCC AAC TAC AAC GCC TGG AGC ATC TCG AGC CCG 1593 
Leu Ala Thr Asn Val Ala Asn Tyr Asn Ala Trp Ser He Ser Ser Pro 
310 315 320 
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CCG CCG TAC ACC AGC CCC AAC CCC AAC TAC GAC GAG AAG CAC TAC ATC 1641 
Pro Pro Tyr Thr Ser Pro Asn Pro Asn Tyr Asp Glu Lys His Tyr He 

325 330 335 

GAG GCC TTC CGC CCT CTC CTC GAG GCC CGC GGC TTC CCC GCC CAG TTC 1689 
Glu Ala Phe Arg Pro Leu Leu Glu Ala Arg Gly Phe Pro Ala Gin Phe 

340 345 350 

ATC GTC GAC CAG GGC CGC AGC GGC AAG CAG CCC ACC GGC CAG AAG GAA 1737 
He Val Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr Gly Gin Lys Glu 
355 360 365 370 

TGG GGC CAC TGG TGC AAT GCC ATT GTACGTTAAG GTTAGGGTTA CATATTTGCG 1791 
Trp Gly His Trp Cys Asn Ala He 
375 

TTCCCATGAC TAACATCCTT CCAG GGC ACC GGC TTC GGT ATG CGC CCG ACT 1842 

Gly Thr Gly Phe Gly Met Arg Pro Thr 
380 385 
GCC AAC ACC GGC CAC CAG TAC GTC GAC GCC TTC GTC TGG GTC AAG CCC 1890 
Ala Asn Thr Gly His Gin Tyr Val Asp Ala Phe Val Trp Val Lys Pro 

390 395 400 

GGC GGT GAG TGC GAC GGC ACC AGC GAC ACG ACC GCT GCC CGC TAC GAC 1938 
Gly Gly Glu Cys Asp Gly Thr Ser Asp Thr Thr Ala Ala Arg Tyr Asp 

405 410 415 

TAC CAC TGC GGT CTC GAG GAC GCC CTC AAG CCC GCC CCT GAG GCC GGC 1986 
Tyr His Cys Gly Leu Glu Asp Ala Leu Lys Pro Ala Pro Glu Ala Gly 
42 0 425 430 435 
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CAG GTGAGCACCA AACCCGACCA CAACAAGAAA TGTACCAAAG GCTAACCAAC TCCAG 2044 
Gin 

TGG TTC CAA GCC TAC TTT GAG CAA TTA CTT CGT AAT GCC AAT CCG CCG 2092 
Trp Phe Gin Ala Tyr Phe Glu Gin Leu Leu Arg Asn Ala Asn Pro Pro 

440 445 450 

TTC TGA GCGGTTTGAG GCGTTTGGCG CGATGTTGGC GATGTTTAGG ATCAAAAAGG 2148 
Phe *** 

GGGGGAAAAG GCGAAAAGGG GCCGGTCCGG GAGGCCCCAC AATATCGGCC CCACCCTCCG 2208 
ATCACGTGCT CCCCGCATCG GCACAGACGT CGCTTAATGC ATTGAGGGGG TTGACAAAAT 2268 
TCAAGTCTTC TTCTGTAAAT AGTTGGCATC TGCCATTGTT GGACAAGATT TAGTCTTTCG 2328 
AGTATATACA CTTTGTTCCA ACGGGGTCTA GTAACTTCCG AGGTCATCTC ATCAAGCATT 2388 
GTTTGAGTCT CGCGTTTATA C 2409 

w&m^ : 3 

IE?U©^£ : 1 2 5 7 

mnv>w. : mm. 

:Genomic D N A 

mm 

£j£$9£r : Humicola insolens 

^Wi^^.'ti'd.^r : intron 
#£ftg : 4 5 3 . . 5 0 9 
ftWt.Z&mLtzJj& : E 
SE?'J 
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AATGACGGGG CAACCTCCCG CCCGGGCCCA ACTCTTGGGT TTGGTTTGAC AGGCCGTCTG 60 
TCTCTTGCGT CCTCTTACTA CGCCTGCCTG GACCCTACGT CTCAACTCCG ATTCAAG 117 
ATG CGT TCC TCC CCT CTC CTC CGC TCC GCC GTT GTG GCC GCC CTG CCG 165 
Met Arg Ser Ser Pro Leu Leu Arg Ser Ala Val Val Ala Ala Leu Pro 

"20 -15 -io 

GTG TTG GCC CTT GCC GCT GAT GGC AAG TCC ACC CGC TAC TGG GAC TGC 213 
Vai Leu Ala Leu Ala Ala Asp Gly Lys Ser Thr Arg Tyr Trp Asp Cys 

" 5 1 5 10 

TGC AAG CCT TCG TGC GGC TGG GCC AAG AAG GCT CCC GTG AAC CAG CCT 261 
Cys Lys Pro Ser Cys Gly Trp Ala Lys Lys Ala Pro Val Asn Gin Pro 

15 20 25 

GTC TTC TCC TGC AAC GCC AAC TTC CAG CGT CTC ACT GAC TTC GAC GCC 309 
Val Phe Ser Cys Asn Ala Asn Phe Gin Arg Leu Thr Asp Phe Asp Ala 

30 35 40 

AAG TCC GGC TGC GAG CCG GGC GGT GTC GCC TAC TCG TGC GCC GAC CAG 357 
Lys Ser Gly Cys Glu Pro Gly Gly Val Ala Tyr Ser Cys Ala Asp Gin 

45 50 55 

ACC CCA TGG GCT GTG AAC GAC GAC TTC GCG TTC GGT TTT GCT GCC ACC 405 
Thr Pro Trp Ala Val Asn Asp Asp Phe Ala Phe Gly Phe Ala Ala Thr 
60 65 70 75 

TCT ATT GCC GGC AGC AAT GAG GCG GGC TGG TGC TGC GCC TGC TAC GA 452 
Ser He Ala Gly Ser Asn Glu Ala Gly Trp Cys Cys Ala Cys Tyr Gl 
80 85 90 
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GTAAGCTTTG GTCGCGTGTG TAACACTGTG CAGGCATAGC ACTAACCACC TCCCAG G 509 

u 

CTC ACC TTC ACA TCC GGT CCT GTT GCT GGC AAG AAG ATG GTC GTC CAG 557 
Leu Thr Phe Thr Ser Gly Pro Val Ala Gly Lys Lys Met Val Val Gin 

95 100 105 

TCC ACC AGC ACT GGC GGT GAT CTT GGC AGC AAC CAC TTC GAT CTC AAC 605 
Ser Thr Ser Thr Gly Gly Asp Leu Gly Ser Asn His Phe Asp Leu Asn 

110 115 120 

ATC CCC GGC GGC GGC GTC GGC ATC TTC GAC GGA TGC ACT CCC CAG TTC 653 
He Pro Gly Gly Gly Val Gly He Phe Asp Gly Cys Thr Pro Gin Phe 

125 130 135 

GGC GGT CTG CCC GGC CAG CGC TAC GGC GGC ATC TCG TCC CGC AAC GAG 701 
Gly Gly Leu Pro Gly Gin Arg Tyr Gly Gly He Ser Ser Arg Asn Glu 
140 145 150 

TGC GAT CGG TTC CCC GAC GCC CTC AAG CCC GGC TGC TAC TGG CGC TTC 749 
Cys Asp Arg Phe Pro Asp Ala Leu Lys Pro Gly Cys Tyr Trp Arg Phe 

160 165 170 

GAC TGG TTC AAG AAC GCC GAC AAC CCG AGC TTC AGC TTC CGT CAG GTC 797 
Asp Trp Phe Lys Asn Ala Asp Asn Pro Ser Phe Ser Phe Arg Gin Val 

175 180 185 

CAA TGC CCA GCC GAG CTC GTC GCT CGC ACC GGA TGC CGC CGC AAC GAC 845 
Gin Cys Pro Ala Glu Leu Val Ala Arg Thr Gly Cys Arg Arg Asn Asp 
' 190 195 200 
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GAC GGC AAC TTC CCT GCC GTC CAG ATC CCC TCC AGC AGC ACC AGC TCT 893 
Asp Gly Asn Phe Pro Ala Val Gin He Pro Ser Ser Ser Thr Ser Ser 

205 210 215 

CCG GTC GGC CAG CCT ACC ACT ACC AGC ACC ACC TCC ACC TCC ACC ACC 941 
Pro Val Gly Gin Pro Thr Ser Thr Ser Thr Thr Ser Thr Ser Thr Thr 
220 225 230 2 35 

TCG AGC CCG CCC GTC CAG CCT ACG ACT CCC AGC GGC TGC ACT GCT GAG 989 
Ser Ser Pro Pro Val Gin Pro Thr Thr Pro Ser Gly Cys Thr Ala Glu 

240 245 250 

AGG TGG GCT CAG TGC GGC GGC AAT GGC TGG AGC GGC TGC ACC ACC TGC 1037 
Arg Trp Ala Gin Cys Gly Gly Asn Gly Trp Ser Gly Cys Thr Thr Cys 

255 260 265 

GTC GCT GGC AGC ACC TGC ACG AAG ATT AAT GAC TGG TAC CAT CAG TGC 1085 
Val Ala Gly Ser Thr Cys Thr Lys He Asn Asp Trp Tyr His Gin Cys 

270 275 280 

CTG TAA ACGCAGGGCA GCCTGAGAAC CTTACTGGTT GCGCAACGAA ATGACACTCC 1141 
Leu 

CAATCACTGT ATTAGTTCTT GTACATAATT TCGTCATCCC TCCAGGGATT GTCACATATA 1201 
TGCAATGATG AATACTGAAC ACAAACCTGG CCGCTTGAAC TGGCCGAAGG AATGCC 1257 

R?'J©S$ : 1 6 
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*£M5=i : Humicola insolens 

Gin Asn Cys Gly Ser Leu Thr Thr Glu Arg His Pro Ser Leu Ser Trp 
15 10 15 

iffiWSS : 2 0 

%M%x : Humicola insolens 

Val Val Glu Glu Arg Gin Asn Cys Gly Ser Ala Asp Gly Lys Ser Thr 
1 5 10 15 

Arg Tyr Trp Asp 
20 

I£?iJ©^£ : 2 1 

tarn 
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: Humicola insolens 

Gin Asn Cys Gly Ser Ala Asp Gly Lys Ser Thr Arg Tyr Trp Asp Cys 

1 5 10 15 

Cys Lys Pro Ser Cys 

20 

SE?iJ#-s§-7 
E5*J©fi$ : 1 6 

h*ov>- : m.mvt 
mm 

: Humicola insolens 

E3WJ 

Gin Gin Ala Gly Ser Ala Asp Gly Lys Ser Thr Arg Tyr Trp Asp Cys 
1 5 10 15 
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1. y i 3-5 • 4 >VU>74*Ot;l/7-^NCE 1 ite^tfefiNC 

E2itfe : F©$iJtSli2W. ^7; KpM3-l*fctt?*5X5 KpM14-l 
ftfc& 5 N C E 1 fc«fctf 2 0*dfflIE^J"T?*S> ffi#^ 1 i£M<0&m<? ? -o 

3. lufe^JftlEW^ ^nt-^-, '>r^-;l/SE^J> fccfctf^-i;*-^- 

4. lulB^n^-^-SB^J^ K P M 3 - 1 
0NZ3&frS>±m<Dmi 5 0 0 b p iT'O^^I^St^E^ttlii^Dt 

5. NCE l*fi^©N5faS^£±ffi©*&l 5 0 0 b p*T©ffi«*lcfipfiE 
-£-<5EW> ^57; KpM3 - 1 4*£>NC E imGrP<DN3&#frP>±ffi<DB g 

hum h*-e©E?ijT*5> m$tm4mm<D&m'<? ? -o 

6 . lulB^n ^e- 9 -E3W> h'pMl 4 — 1 N C E 2 *t£ 
^©N3^^£±ffi(D*4jl 5 0 0 b p*T?©«l«*C??^t--5S£J'Jt^«i^n 

7 . NCE2 itfe^ <£ N3fc3^ £_t&f£CD& I 5 0 0 bp* T?©fig«4»l=#ft 
t«W> "/7X; KpM14-14>©NCE2 itfcT© N5fcjj*}^ £±i5?t© E 

8 . mre •> r ;ue e^j#-§- i \zmm® r i j mmmo -22*^- 

1 £ "COE^i* - StSE^iJ L < \tmm^ 2 (cfB$&<D TU 1?E?»J® 
-23i^-lt T?fflE?i|* 3 - Kf - aJSSEyik * li C ft t>&SE?»JCD&^ 
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9. iE^-i^-^-W, ^77;KpM3-l*®, NCElitfc 
^©C^g^t.T^O^il 4 0 0 b p^"C©®«4itc#^f SBE^Jtfctt^- ^ 

10. NC E ia^©C5fc^STffi©&l 4 0 0 b p £T•cD^^I^ite 3 l;:# 
^E1f-Si^W, N C E 1 afi^© C3W*&T«M> B g 1 IIiM h *TO^|-c 

1 1 . iiutE* - S * - * -W, ^7X; K p M 1 4 — 1 t£<D N N C E 2 

afe^oc^as^(?>T^o^5 o o b P £r-<D$MmtpizftiE-f zm?<\*tzi±? - 

5 * - * -«*6*fiHW « * © 3teC^jT?* -5 . 3 !E«©f£3i^ ? * - c 

12. N C E 2 m&fr<D C 6T*0» 5 0 0 bp| -evimfrKm 
■fZmmU NCE'2slfe?©C*SB^6T«EOBg lil-*-f h *-C©K3?iJ-e& 

5. it 1 1 tamofgii^ * * - e 

14. iiflS2§fi<l*>'*?«a< % t;l/7-4fNC E 4 ifcliftib©^ > 

1 5. ae^-a-fcetc^-ett^ 4®^^-^ 

1 6 . afc-?^-* -*<««wttae^"e* -5. §* 1 5 (cib^cd^k 

1 7. it£?v- # — ^Streptomyces rimofaciens&^cD-f^ h T-f >jRnf 
m&fc=£T'& If ^ 1 5 KfS«6D#£gp< ? * - c 

18. M^^^-pMKDOl, pEGDOU *^«pIED0 2 o 
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1 9. awssi o~i 8©v^-rn*>— ^i2«© / <^^-i-«koT0m£«i 

20. y i ^-vmizm-tzw&Va&y i=i-? • j > v u m 

2 2. ^ 3-5 • >VU>7T-*^ ^©igSf&l 'J >v 

fct) 4. 5 gEl_t<0* 8**^2 lie«S©^i4o 

2 3. fff^l§2 l*fcl*2 2 lztZm®Jj&lz&r>T£.mZtltz* 

2 4. i&£^2 13 fcii 2 2 Kfe*B©#&K<fcoT. ^^^-iLt^ 

25. tt*&2 0£tzi*2 l(cfBig©Srj*^ckoT, ^^^-ilt% 

2 6. »#3K2 l*fctt2 2(cfa«®^{c«J;oT. 
^K^^-p I ED 0 2*ffl^fc»^*=»5»n«5^ N5^j«<ciE?iJ»-%7«cSStt© 

27. KpM3-l N C E 1 N*JB*» £±»fcCD£j 1 

5 0 0 b p*T0^*lcSSt5E?tJifcliil7 , Dt-^-SM^t5^ 

2 8. N C E 1 il^ON^JS^ £±3fe©&J 1 5 0 0 bp* T©flW*K# 
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/7X2KpM3-l*ONCEl »fc*©N*»*S±*© B 

2 9. ^XiKpMH-l*®, NCE2«WON3Wi^6±«©» 
1 5 0 0 b P*Tf©«*KMt-*B3»J*fcJ4K^o*~^-^*fl lJW(6 

3 0. N C E 2«fe?ON5|c«^e > ±jfE©««J 1 5 0 0 b pl^Ofi^cff 

KpMl 4-l*©NCE2it6?F©N3Wi^6±*© 
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